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Abstract 

Address autoconfiguration is an important mechanism required to set the IP address of a node automatically in a wireless 
network. The address autoconfiguration, also known as initialization or naming, consists to give a unique identifier ranging 
from 1 to n for a set of n indistinguishable nodes. We consider a wireless network where n nodes (processors) are randomly 
thrown in a square X, uniformly and independently. We assume that the network is synchronous and two nodes are able to 
communicate if they are within distance at most of r of each other (r is the transmitting/receiving range). The model of this 
paper concerns nodes without the collision detection ability: if two or more neighbors of a processor u transmit concurrently 
at the same time, then u would not receive either messages. We suppose also that nodes know neither the topology of the 
network nor the number of nodes in the network. Moreover, they start indistinguishable, anonymous and unnamed. 

Under this extremal scenario, we design and analyze a fully distributed protocol to achieve the initialization task for 
a wireless multihop network of n nodes uniformly scattered in a square X. We show how the transmitting range of the 
deployed stations can affect the typical characteristics such as the degrees and the diameter of the network. By allowing 
the nodes to transmit at a range r = y ^^'^ (slightly greater than the one required to have a connected network), 

we show how to design a randomized protocol running in expected time 0(n^/^ log^ n) in order to assign a unique number 
ranging from 1 to n to each of the n participating nodes. 
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I. Introduction 

Distributed, multihop wireless networks, such as ad hoc networks, sensor networks or radio networks, 
are gaining in importance as subject of research [28]. Here, a network is a collection of transmitter-receiver 
devices, referred to as nodes (stations or processors) . 

Wireless multihop networks are formed by a group of nodes that can communicate with each other over 
a wireless channel. Nodes or processors come without ready-made links and without centralized controller. 
The network formed by these processors can be modeled by its reachability graph in which the existence 
of a directed edge u ^ v means that v can be reached from u. If the power of all transmitters/receivers 
is the same, the underlying reachability graph is symmetric. As opposed to traditional networks, wireless 
networks are often composed of nodes whose number can be several orders of magnitude higher than the 
nodes in conventional networks [1]. Sensor nodes are often deployed inside a phenomenon. Therefore, 
the positions of these nodes need not be engineered or pre-determined. This allows random and rapid 
deployment in inaccessible terrains and suit well the specific needs to disaster-relief, law enforcement, 
collaborative computing and other special purpose applications. 

As customary, the time is assumed to be slotted and nodes can send messages in synchronous rounds 
or time slots. In each round, every node can act either as a transmitter or as receiver. A node u acting 
as receiver in a given round gets a message, if and only if, exactly one of its neighbors transmits in the 
same round. If more than two neighbors of u transmit simultaneously, u receive nothing. That is, the 
considered networks do not have the ability to distinguish between absence of message and collision or 
conflict. This assumption is motivated by the fact that in many real- life situations, the (small) devices 
in used do not always have the collision detection ability. Moreover, even if such detection mechanism is 
present, it may be of limited value especially in the presence of some noisy channels. Therefore, it is highly 
desirable to design protocols working independently of the existence/absence of any collision detection 
mechanisms. 

We consider that a set of n nodes are initially homogeneously scattered in a square X of size \X\. As in 
several applications, the users of the network can move, and therefore the topology is unstable. For this 
reason, it is desirable for the protocols to refrain from assumptions about the network topology, or about 
the information that processors have concerning the topology. In this work, we assume that none of the 
processors have initially any topological information, except the measure (surface) \X\ of the square X 
where they are randomly dropped. We pinpoint here that even if \X\ is exactly known but not n then 
even if n = 0(|X|), equation such as (7) in the theorem 2 (see below) allows us to handle the subtle 
changements involved in the constant hidden by the "big-Oh" between n and 0(|X|). 

Self- configurations of networking devices appear to be one of the most important challenges in wireless 
and mobile computing. Before networking, each node must have a unique identifier (referred to as ID or 
address) and it is highly desirable to have self-configuration protocols for the nodes. A mechanism that 
allows the network to create a unique address (ID) automatically for each of its participating nodes is 
known as the address autoconfiguration protocol. In this work, our nodes are initially indistinguishable. 
This assumption arises naturally since it may be either difficult or impossible to get interface serial numbers 
while on missions. Thus, the IDs self-configuration protocols do not have to rely on the existence of serial 



3 



numbers. 

The problem we address here is then to design a fully distributed protocol for the address autoconfigu- 
ration problem (also known as initialization [22] or naming problem). By distributed protocol, we mean 
without the need of any preexisting centralized controller or base station, or requiring human interventions 
(network administrators). 

To this end, we remark first that the transmitting range of each station can be set to some value r 
ranging from to Tmax- This model is commonly used in mobile computing and radio networking [5], 
[15], [18], [19]. Note that such model is frequently encountered in many domains ranging from statistical 
physics to epidemiology (see for example [12] for the theory of coverage processes or [17] for percolative 
ingredients). The random graphs generated this way have been considered first in the seminal paper of 
Gilbert [10] (almost at the same time Erdos and Rcnyi considered the G{n,p) model [8]) and analysis of 
their properties such as connectivity and coverage have been the subject of intense studies [11], [20], [24], 
[25] , [26] , [27] . It is easy to see that if the transmitting range of the devices is set to 0, the underlying graph 
has no edges. If the transmitting range is too large, the graph is extremely dense making the scheduling 
of communications difficult. The Figure 1 below shows devices which have been deployed on some field 
in a random fashion. The examples of the figure suggest that transmission ranges can play a crucial 
role when setting protocols at least for randomly distributed nodes. Other important parameters are the 
number n of active stations, the shape of the area X where the nodes are scattered and the nature of the 
communications to be established. For instance, in [11] the authors considered a set of n nodes and a disk 
of unit area. In this case, if the range of transmission of the stations is set to a value r = r{n) satisfying 
-Kr{n)'^ = (log n + w(n))/n, it was shown by Gupta and Kumar that the wireless network is asymptotically 
almost surely (a.a.s., for short) connected if and only if uj{n) tends to oo with n. Throughout this paper, 
an event f„ is said to occur asymptotically almost surely if and only if the probability Pr [£„] tends to 1 
as n — > oo. We also say f„ occurs with high probability (w.h.p. for short). 




Fig. 1 

A TYPICAL RADIO NETWORK IS GENERATED VIA UNIFORM DISTRIBUTION OF THE X AND y COORDINATES OF THE DEVICES AND 
THE TRANSMISSION RANGES OF THE NODES ARE INCREASED GRADUALLY (fROM LEFT TO RIGHT). THE TWO LAST PICTURES 
SHOW THAT IF THE OBTAINED GRAPH HAS MORE EDGES THAN NEEDED, THE NUMBER OF COLLIDING PACKETS IS MORE 

DIFFICULT TO CONTROL. 

According to these observations, to design efficient protocols, we have to take into account and to exploit 
the structural properties of the reachability graph. In our scenario, since none of the nodes knows the 
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number n of the processors in the network, our first task is to find distributed protocols that allow 
(probabilistic) counting of these nodes. We then go on to show that by setting the transmitting range 
parameter correctly, the network can be auto-initialized in expected average 0{n^^^ In^ n) time slots. 
As far as we know, this is the first analysis for the initialization protocols in the multihop cases (the 
single-hop cases have been treated in the litterature in [13], [22], [23], [30]). Our algorithms are shown 
to take advantage of the fundamental characteristics of the network. These limits are computed with 
the help of fully distributed protocols and once known, a divide- and- conquer algorithm is run to assign 
each of the n processors a distinct ID number in the range from 1 to n. Even though the protocols are 
probabilistic, once the IDs arc attributed their uniqueness can be checked deterministically by for example 
the use of deterministic algorithms such as those of Chrobak, Gasicnicc and Rytter in [6]. As a result, the 
combination of both protocols leads to an initialization protocol which always succeeds and only whose 
running time is random. 

Under the conditions described above, the Figures 2 and 3 summarize briefly the input and output of 
the distributed protocols presented in this work. 



Input: 



Output: 





Fig. 2 

n INDISTINGUISHABLE AND IDENTICAL PROCESSORS 
RANDOMLY PLACED IN THE SQUARE X. THE ONLY 
KNOWLEDGE RECQUIRED IS THE SIZE \X\ OF THE SUPPORT. 



Fig. 3 

Each of the n processors is assigned a unique ID 
ranging from 1 to n. these ids can serve as ip 
address. 



The remainder of this paper is organized as follows. Section 2 first presents a randomized protocol 
SEND which plays a central key role throughout this paper. We then analyze this protocol. In Section 3, 
we discuss about how to set correctly the transmission range of the nodes. Section 3 also offers results 
about the relationship between the transmission range r, the number of active nodes n, the size of X, the 
maximum degree A and the hop-diameter D of the wireless networks. These results and the use of the 
procedure SEND allow us to build a BROADCAST protocol. Section 3 ends with the design and analysis of a 
protocol called SFR which plays a central key role to check the correct transmission range. In Section 4, we 
briefly recall how to initialize n processors in the case of single- hop network (i.e., whenever the underlying 
reachability graph is complete). Using the BROADCAST protocol, we then turn on the design and analysis 
of a randomized address autoconfiguration protocol in the case of wireless multihop networks. 
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II. The basic protocol for sending information 



First of all, no deterministic protocol can work correctly in the networks when processors are anony- 
mous. This can be easily checked: conflict between two processors absolutely identical can not be solved 
deterministically. Therefore, this impossibility result implies the use of randomness (see [3]). Since our 
processors do not have unique identifiers, our first task is to build a basic protocol for the nodes which 
compete to access the unique channel of communication in order to send a given message. This can be 
achieved by organizing a flipping coin game between them. Recall also that if the transmission/receiving 
range is set to a value r, only neighbors of distance at most r are able to communicate when conflicts 
are absent. Thus, in the following procedure we have to take into account this parameter as well as the 
duration T of the trials: 

Procedure SENT){msg, T, r) 
For i from to T do 

With probability 1/2* send msg to every neighbor 

(★ processor within distance at most r ★) 

end. 



Note that r is here a parameter which can be tuned to a precise value. Again, it should be clear now that 
only neighbors within distance at most r can receive the message when there is no conflict. We have the 
following result: 

Theorem 1: Denote by Tcon the transmission range required to get a connected reachability graph. Let 
r > ''con be the current transmission range of the processors. For a flxed node v, denote by dy its degree 
with dy = dv{r). Suppose that each of the dy neighbors of v start the execution of SEMD(ms5, T, r) at the 
same time. Let P{T, dy) be the probability that v will receive the message msg at least once between the 
time t = and i = T. Then P(T, d^) satisfies: 

(i) the hmit limr^oo P{T,dy) exists 

(ii) if ^ 00 but 2^ ;» dy then 



Proof. The assumption that the reachability graph is connected insures that for all processor v, the 
degree of v verifies dy > 0. To prove (i), it suffices to observe that {P{T,dy))T is an increasing sequence 
bounded by 1 so that it converges. For (ii), we have 




(1) 



□ 





Denote by St{d) the quantity: 




(3) 
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For any given ii and for all i > ii, we have 




Therefore, if t — > oo such that 2* d by choosing ii = [| log2 d\ we obtain 

s <»p -Es;E|=»p(-f^(i + F-)) 

y m>l i=ii ^ ' y 

< .1884 + 0(i) +0(i). 

We used the so-called McUin transform asymptotics detailed in [9] and in [16, p. 131]. The value 
exp ^— m!/(m™+^ log 2)^ = .188209... has been numerically computed with Maple. Note that 
similar computations show that St{d) > .1883 + 0{l/d) + 0(d/2*) but the above inequality suffices to 
obtain (2). ■ 

Remark 1. Denote by T* the duration of the coin flipping game between d nodes needed to succeed to 
send a message to a common neighbor. Theorem 1 asserts that for a suitable value of the transmission 
range r such that the graph is connected, T* — T*(d) is in probability at most log2 d + 0(1). That is the 
probability Pr{T* > log2 d + io{d)) — > for any arbitrary function cj{d) tending to infinity with d. This 
can be checked by standard probabilistic arguments since as soon as 2^ d, the probability of confiict is 
less than 0.188... . 

Remark 2. In [3], Bar- Yehuda et al. have designed a randomized procedure called DECAY to send 
information. They have shown that if d neighbors of a node v execute DECAY simultaneously, then after T 
time slots with T > 21og2(rf) and with probability greater than 0.5, the node v receives a message [3, pp 
108-109]. In our procedure SEND, the proof of theorem 1 (see also [9]) shows that by changing the basis 
of the coin ffipping game, viz. replacing the probability 1/2* in the algorithm by 1/a* for any constant 
a > 1, the probability of success of the T trials can be made arbitrary close to 1 (after similar logarithmic 
number of time slots satisfying ^ d). 

In the next Section, we turn on the problem of finding suitable values of transmission range whenever 
the only a priori knowledge of the processors is the size of the square X. 

111. Transmission ranges and characteristics of the network 

The aim of this Section is to provide randomized distributed algorithms that allow the nodes in the 
network to find the right transmission range such that the reachability graph is at least connected. To 
this end, we need to know the relationships between the transmission range r, the number of processors n 
and the measure \X\ of the square. Other fundamental characteristics of the graph, such as the minimum 
(resp. maximum) degree S (rcsp. A) and the hop-diameter D are also of great interest when designing 
wireless protocols (see [3]). Moreover, the limits of the randomly generated network of processors help 
as it will be shown shortly. We refer here to [10], [11], [20], [27], [32], [33] for works (under various 
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assumptions) related to the fundamental characteristics and limits of random plane networks. For sake of 
clarity, we treat in this Section two distinct paragraphs: 

• The first one concerns the characteristics of the reachability graph in the superconnectivity regime, i.e. 
when the radius of transmission of the nodes grows much more faster than the one required to achieve 
the connectivity of the graph. 

• The second subsection is devoted to the design and analysis of a distributed protocol, called SFR, that 
will allow the nodes to approximate their number. At this stage, the technical characteristics related to 
fundamental limits of the graph relating r, n, \X\, D, A described previously, become extremely important. 

A. Fundamental limits of a random network in the superconnectivity regime 

For several reasons, we follow here the Miles's model [20]. In this model, a large number n of devices 
are dropped in some area X. As n ^ oo but n = 0{\X\), the graph generated by the transmitting 
devices can be well approximated with a Poisson point process (see for instance [12]). First of all, this 
extreme independance property allows penetrating analysis. Next, since Poisson processes are invariant 
if their points are independently translated (the translations being identically distributed following some 
bivariate distribution), the results can take their importance for moving nodes and therefore, they are 
well suited to cope with randomly deployed mobile devices. Third, due to Poisson processes properties, 
if with probability p, such that p x n — 0{\X\), some nodes arc faulty or intentionally asleep (e.g. to 
save batteries to design energy-efficient algorithms), our results remain valid. In this latter scenario, the 
number of nodes n is simply replaced by n' = p n. 

Among other results, Penrose [26] proved (with our notations) that if n/jXj = 0(1) and if Toon denotes 
the minimal radius of transmission to achieve connectivity then 

= exp(-e-'^), wGK. (5) 

Penrose's result tells us that by letting the radius of transmission range growing as 

/ Inn + ^n) ~ 
V nn 

for any arbitrary function of oj{n) tending to infinity with n, the obtained graph of the network is a.a.s. 
connected. 

For our purpose, we need the following results related to the degrees of the nodes participating in the 
network according to successive values of the transmission range: 

Theorem 2: Denote by r the transmission range of the n nodes randomly distributed in the square X 
of size \X\ = 0{n). Then, in the following regimes with high probability the graph is connected and we 

have: 

(i) For fixed values of k, that is fc = 0(1), if ^j^''"'^ = l^"' + fclnlnn + w(n), then the graph has a.a.s. a 
minimum degree of (5 = fc. 

(ii) Let k = k{n) but 1 <C fc(n) <C Inn/ In Inn. If TTj^r^ = Inn + fc(n) In Inn, then the minimum degree 
(resp. maximum degree) is a.a.s. d = k{n) (resp. A = e Inn). 



lim Pr 

n— ^cx) 



n 



8 



(Hi) If TTr^r^ = (1 + Inn with £> then each node v of the graph has a.a.s. dy neighbors with 



Inn 



+ o (In n) < dv < 



£ Inn 



o (In n) , 



(7) 



where W-i and Wq denote the two branches of the Lambert W function^ which are detailed in [7]. See 
also the appendix of this paper for some details about the two branches of the Lambert W function. 
Each geographical point of the support X is also recovered by 9 (Inn) disks of transmission in the case 
TTifyr^ = (l+^)lnn. □ 
Proof. We refer to [30] where asymptotic coverage as well as connectivity properties are treated in 
details for the ranges of transmission considered in the theorem 2. ■ 

Remark 3. The theorem 2 above remains valid for any bounded surface X. Thus, it answers a conjecture 

of the authors of [29]. 

Next, wc derive an upper-bound of the hop-diamctcr D in the supcrconncctivity regime: 

Theorem 3: Define by I? = D{r) the hop-diamctcr of the graph which is a priori a function of r the 
transmission range. D is the maximum number of hops required to travel from any node u to another 
node V of the network. Suppose that the transmission range satisfies r = \J '^^^^n^ — \X\ with £ > 0. We 



then have: 

(i) If £ > |Ei then 



(ii) If £ < fEi then 



lim Pr 

n— ^oo 



lim Pr 

n— ^oo 



£)< 3 



£)< 5 



(1 



Inj 



7rn 



{l + f) Inn 



0(1) 



+ 0(1) 



1. 



1. 



(8) 



(9) 



□ 



Proof. Split the square X into p equal subsquares Si, S2, ■ ■ ■ , Sj2. Each of the subsquares has a side 
^/\X\lj and an area Choose j such that each subsquare Si can entirely a circle of radius equals 

to r as depicted below. 



Subdivision of X That is = r = . So, 3 = 

2,j y 7rn ' ^ 



Size \X\/f 



{i+l)\nn - ^^"^ ^^^'^ °f simplicity 
but w.l.o.g., we suppose that j G N. By the theorem 2 (iii) given above, with 
high probability we have 6 (Inn) nodes inside the circle. Any pair of processors 
inside the same circle need at most 2 hops to be connected since they are 
at distance at most 2r and since each subgraph inside such a circle is a.a.s. 
connected. 



We claim that from two adjacent subsquares Si and S2, communications between any node a € Si and 

any node 6 G 6*2 need at most (w.h.p.) : 

a) 6 hops for £ > |ei = 0.7519... and 

^The Lambert W function is considered as a special function of mathematics on its own and its computation has been 
implemented in mathematical software such as Maple. 
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b) 10 hops for I < 

To prove the first part, viz. a), consider adjacent subsquares as follows : 



At most 6 HOPS 





L / \ 

\ 1 


1 > 




1 >r ^ 



1 


/ \ 

> 1 < 1 


/ \ \ 


/ 



At most 

10 HOPS 


\ \ 


y X 

4 / \ 

\ 1 


t \/ t 



Fig. 4 

Horizontal transmission. 



Fig. 5 

Diagonal transmission. 



Fig. 6 

"Undirect" transmission. 



A bit of trigonometry shows that each lens-shaped region such as L\ has a surface of exactly \L\\ = 
g(47r — 3-\/3)r^. Note that Li represents the intersection of two disks of equal radius r whose centers are 
at distance r. Therefore, there is no node inside the lens-shaped region Li with probability 



(l - = (l - 1(4. - 3^/3)^i±^)" < exp (-1(4. - 3^/3)(l +f)n) . 



(10) 



Since each subsquare has at most 4 lenses of size |Li|, none of these regions is empty with probability at 
least 



1 — exp 



1(4.- 3V3)(1 -F£)n 



(4x/) 



1 



1 



„i(47r-3\/3)(l+^) 



> exp -2 X 



-i(47r-3\/3)(l+£)\ 



(11) 



(l-f^) inn 

Hence, with probability tending to 1 as n ^ oo, in every lens-shaped region of size \Li\ there is at least 
a node. Thus, to transmit message between two horizontally (or vertically) adjacent subsquares, we need 
at most 6 hops (see Figure 4). 

To prove b), we consider lenses such as L2 depicted in Figure 5. The size of such region is IZ/2I = ^^-^f'^ 
which measures the area of the intersection of two equal disks of radius r and at distance \/2 x r. Arguing 
as for (11), we find that for every lens of size \L2\ to be non-empty (w.h.p.) we need that (1 + 1)^^^ > 1. 



This condition is only satisfied ii £ > 



7r-2 



- 1 



0.7519.... For values of £ < 0.7519..., transmissions are 



sent horizontally then vertically (or vice- versa). Such transmissions can required up to 10 hops (cf. Figure 
6). The proof of the theorem is now easily completed by simple counting arguments. ■ 

X\. It is obvious that 



Remark 4- In theorem 3, we consider transmitting ranges of the form y ^^^^i'"" 
if the transmitting range augments, the hop-diameter diminishes. For instance, for values of transmitting 
range satisfying .p^r^ = Lu{n) Inn, with ui{n) :s> 1, a.a.s. D < 3 \J We refer the reader to the paper 
[33] where the authors obtain similar results (with the notations of our paper, they obtained upper-bounds 
for r > sj^^^^)- We also remark here that our theorem 2 yields an immediate lower-bound of D to 
finally show that if r satisfies r > (1 + ^) Tcon for any £>Q then w.h.p. D = ^{^/^^)■ 
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B. Adjusting the transmission range and fundamental limits 

The previous paragraph gives us almost sure characteristics of the network but we need to verify and 
to exchange these informations by means of distributed protocols. To this end, we need two procedures. 
The first one is the BROADCAST protocol. In this protocol, some processors (called sources) try to diffuse a 
given message to all the nodes in the network. It makes several calls of SEND. The second procedure SFR 
(for "search-for-range" ) is used to adjust the correct transmission range of the nodes in order to "take 
control" of the main characteristics of the network. It works as follows. 

Each processor starts with the maximum range of transmission. Then at each step, the transmission 
range is diminished gradually until the deconnexion of some of the nodes. At this stage, these isolated 
nodes readjust their transmission range (in order to be re-connected) and make call to BROADCAST to send 
a message of "deconnexion" to all the processors in the network. A processor quits the protocol if and 
only if cither it has been isolated once, is reconnected and has sent the "deconnexion" message or it has 
received the "deconnexion" message containing information about the adequate transmission range. 

Now for details, we start with the BROADCAST protocol. The procedure is similar to the one in [3] except 
that we use SEND to transmit messages. 



Procedure BR0ADCAST(ms5, e, A, r, A'^) 
k := 2[log2 A] 

T := riog2 (A/£)l 

Wait until receiving a message msg 
For i from 1 to r do 

Wait until TIME mod k = l 

SEm{msg, k, r) 

end. 



(★ A is an upper-bound of the maximum degree ★) 
(★ N is an upper-bound of the number of nodes ★) 

(★ to synchronize *) 
(★ attempt to send msg *) 



In the procedure above, s > can be made arbitrarily small. A is a parameter representing the maximum 

degree of the network (or an upper bound of the maximum degree. This can be computed for a given 
value of the transmission range using theorem 2). A is an upper-bound of the number of participating 
nodes. TIME is a protocol which allows a given node to have the current time. Following the proves of [3, 

Theorem 4], we have: 

Theorem 4: Bar- Yehuda, Goldreich, Itai [3]. Suppose that r is the actual transmission range of the 
nodes. Assume that A (resp. A) is an upper-bound of the maximum degree (resp. the number of nodes) 
in the network and let T = 2D + 5x max {VD, y^log2 (A/e)) x ^/\og2 (A/e). Assume that some initiators 
start the procedure BR0ADCAST(ms5f, e, A, r, A) at TIME = 0. Then, with probability > 1 - 2e by time 
2 [log2 A] T, all the nodes receive the message. Furthermore, with probability > 1 — 2£ all the nodes have 
terminated by time 2 [logs ^1 {T + (A/e)] ) . □ 

Remark 5. Note that in the procedure BROADCAST above, we substitute the DECAY in [3] by SEND since 
our protocol SEND seems to be more efficient than DECAY (cf. theorem 1 in this paper and [3, theorem 1]). 
However, this would only affect by a constant factor the time needed to accomplish a complete broadcast 

in the network as given above. 

We need to have as fast as possible bounds of the value of the number n of the processors. If po = Llog2 
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then 2P0 < n < 2p«+i. Thus, by setting R{2p) := y^ (i"(2^)+2to 2)_[x| ^j^^^ ^ increases, R{2p) 

decreases. In the protocol SFR, we increment p one by one, starting at a value close to the maximal 
transmission range of the processors. When p passes through po ^ 1; Po ^-nd Po + 1) there will be w.h.p. 
some nodes which become isolated. In fact, a bit of calculus shows that 

/ ^Inn^^ 1 (12) 

V Trn 

We are now ready to give the protocol SFR. The procedure SFR is executed by each station and the details 
follow : 
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LO) Procedure SFR(£) 
LI) BEGIN 

L2) R:=x^ 

B -.^x 



(2'')+21n2) \X\ 



P ■ = 



L3) 

L4) 

L5) 

L6) 
L7) 

L8) 

L9) 
LIO) 
Lll) 
L12) 
L13) 
L14) 
L15) 

L16) 

L17) 
L18) 
L19) 
L20) 

L21) 

L22) 
L23) 
L24) 
L25) 
L26) 
L27) 
L28^ END. 
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In a; X ( + x - logs i^)) 



1" (2/e) 
In 5 



DECONNECTED := false ; 

loga (r-MAx) 
REPEAT 

counter := 0; 

t := TOa.T(4 j" log2 (p) , 
For i from 1 to t Do 

SEND("p",«, R{p)); 

If receiving a message "p" Then 

counter := counter + 1; 
Endlf 
EndFor 

If counter = Then 



(* "Search-For-Range" ★) 

(★ Set -R as a local function: R = R{x) ★) 

(* Similarly, the "broadcast time": B = B{x) *) 
{■k Flag for isolated nodes *) 
(* Set the initial value of p to the maximum -k) 

(★ counter for each station*) 



(★ Succession of i trials ★) 



[-k The considered station received no message ★) 



ln(l/2£) 
ln(2e) 



Do 



For j from 1 to 

BROADCAST("Deconnexionp",£, 3p, R{p - 1), 2^+^) ; 
EndFor 

DECONNECTED := 



true ; 



Else 



Wait for a message for 



ln(l/2£) 
ln(2e) 



X B[p — 1) times ; 

(* Give sufficient time to the advertisements of possible isolated stations -k) 
If receiving the dcconncxion message Then 

Scan the value of p and set DECONNECTED := true; 
Else p:=p+l ; 
Endlf 
Endlf 

UNTIL DECONNECTED = true ; 



When reaching the value of pq, the isolated nodes - whose transmission ranges are now set to r = R{2p° ) 
- can increase back their transmission range, viz. i?(2^'''~^), in order to be reconnected. Next, these 
processors have to advert the others about upper-bounds of n, A and D, respectively given by 



-Wo(-e-V2) 



Inn < 3po and < 5. 



7r2P" 



ln2(po + l) 



< 12 



2P0 
Po 



(13) 



where we used theorems 2 and 3 for A and D. The advertisements can be made correctly by means of 
multiple uses of the protocol BROADCAST but we have to give sufficient time slots cf. (L21) to the 
broadcasting processors in order to let the others be aware of the bounds given by (13). The message sent 
for these advertisements is represented by a special message, say "Deconnexion po" which contains the 
right value of po- Taking into account (13), we remark that the "broadcast time" given by the theorem 
4 is (with probability greater than 1 — 2e) less than 2[log2 A] x {2D + 5 x max{\/D, ^/[og^^JWfe)) x 
^log2 (iV/e) + [log2(Ar/£)]). This is strictly less than 241n(po) x [yJ^ + Po - logs (s))- The protocol 
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SFR has the following properties: 

Theorem 5: Suppose that the random deployed network is an instance that has exactly the upper- 
bounds given by (13), that is the main characteristics n, A and D of the input graph satisfy (13) with 
probability 1. For any c > there exist a constant ci > such that with probability at least 1 — the 
protocol SFR(^) terminates in at most 0(lnlnn ) time slots. After this time, with probability at 

least 1 — every node is aware of the upper-bounds of values of n, A and D. □ 
Proof. Due to place limitation, we will just discuss the main lines of the proof. In lines (L9)-(L14), 
the inner loop is repeated t times. Consider a random picked node v. By theorem 1, as soon as i (cf. line 
(L9)) satisfies 2* » dy {dy represents the degree of v), the probability of success of each call of SEND is at 
least O.8.... By theorem 2, and under the hypothesis that the graph satisfies the almost sure properties of 
a random network, if p = po ~ Llog2 (^)J i < 3po. Therefore, by setting t as in line (L8), we insure that 
if the node v is still connected, it will receive more than one message from its neighbors with probability 
at least 1 — §• Similarly, by repeating sufficient calls of BROADCAST for the just deconnected nodes (see 
the discussion above) and give sufficient time to them to send the message of deconnexion to the others, 
we give sufficient chance to the processors of the whole network to know the correct upper-bounds of n 
(and thus A and D). To explain the constants c and ci involved in the result, one can always choose e of 
the form e = in order to obtain probabilities of failure of order l/n'^. ■ 

Despite theorem 5 is given with the assumption that the input network satisfies (13), we believe that SFR 
with slight modifications can handle the other cases (which indeed occur extremely rarely). For instance, 
the line (L17) can be modified in this sense and one can give the upper-bounds in (13) greater values (e.g. 
by augmenting the actual values). 

IV. The initialization protocol 

We have settle in the previous paragraph the problem of determining the correct transmission range for 
the nodes of a random network to have the characteristics (mainly maximum degree and hop-diameter) 
dieted by theorems 2 and 3. We also know through the protocol SFR a probabilistic upper-bound of the 
number of participating nodes. In [2], Bar- Yehuda et al. gave protocols for efficient emulation of a single- 
hop network with collision detection on multi-hop radio network, provided that the number of nodes, the 
diameter and the maximum degree of the network (or upper-bounds of them) are known. Combination 
of their results with ours lead to a new initialization protocol. In the next paragraph, we will first give 
the protocol for the single-hop network without collision detection and then extend it to our purpose by 
means of the emulation protocols given in [2] . 

A. Initialization in the single-hop model. 

In the single- hop model of networks [22], we have direct links between any pair of processors. First, we 
give a procedure that can randomly split a set S of directly connected processors, into many subsets, say 
'S'l, 5*2, • • • , Sk, such that at least two of the subsets are non-empty. In this subsection, we consider that 
the processor has the collision-detection ability. This assumption doesn't change our final results thanks 
to the emulation protocols given in [2] . The following procedure EQUIPARTITION is an implementation of 
Bernoulli process in order to partition a given initial set into k subsets. The process is repeated until the 



14 



original set is partionned into at least two nonnuU subsets. Note that, the "non-empty status" can be 
checked by the nodes since it happens if and only if either there is collision between two or more messages 
or there is exactly one station that has put a message on the channel. 

Remark 6. It is important to remark here that these protocols are originally due to Hayashi, Nakano 
and Olariu [13] (see also [22]) but we put them here for sake of completeness. 

Procedure EC!UIPARTITION(Input: S, k , Var: Number; Output: ^i, S2, ■ ■ ■ , Sk) 
Repeat 

each station selects randomly an integer i E {1, 2, • • • ,k} 
to join Si and broadcasts when TIME mod k = i ; 
Until at least two of the subsets Si, i e [1, k] are non-empty 
For i from 1 to fc Do 
If \Si\ = 1 Then 

the unique station is labeled with N and quits the protocol; 
Number := Number + 1; 

Endlf 
EndFor 
END. 

Therefore, in EQUIPARTITION the test "at least two of the subsets Si,i E [l,fc] arc non-empty" for the 
repeat loop can easily done in the single-hop case with collision detection (in fact, stations can distinguish 
between the lack of message, the presence of exactly one message and collisions). Consider now a group of 
m stations, the probability of failure of the splitting process applied on this subset is Pr [failure] = 
So, with probability 1 — EQUIPARTITION subdivides the inital set of m stations into at least 2 

subsets. For i e [1, k], if \Si\ = 1, that is a processor is alone in a subset, it can be labeled. The labeling 
of such processor is carried out with the variable Number that is incremented each time a station leaves 
the protocol. Tlie procedure INITIALIZATION is tlien called. The details are given as follows: 

Procedure INITIALIZATION(Input: S, k) 

For each station, set Number := 1 and L := 1; 
Initialize Pl ■— { all stations }; 
While L > 1 Do 
Switch \Pl\ 

Case 0; Case 1: 
L-1; 

Default: 

Partition Pl into Pl, Pl+i, •-, Ph+k-i 
with EQUIPARTITION(Pi„ Number, k); 
L:=L + k-l; 

EndWhile 
End. 

The protocol INITIALIZATION has the following property: 
Theorem 6: Let S* be a group of n nodes communicating directly via a single channel. 
INITIALIZATI0N(S', 3) always terminates (with probability 1) and its running time is in average approxi- 
mately j^n time slots. □ 
Proof. The proof of this theorem relies on Mellin transform asymptotics [16], [9] and is omitted. We 
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refer also to the papers [13], [22] for randomized protocols in the single-hop cases and to [21] for the 
average-case analysis of such algorithms. ■ 

B. Initialization of the wireless multihop network 

We turn now on a more general self-configuration protocol designed for wireless multihop network. In the 
light of the previous paragraphs, one can design an initialization protocol as follows : 
Step 1 ) Use SFR for determine upper-bounds of n, A and D, 

Step 2) Emulate the protocol INITIALIZATION for single-hop networks described above in order to give 
IDs for the nodes of the wireless multihop network. To this purpose, a natural idea is to repeat the number 
of tests to check how many stations are actually broadcasting. This can be done by means of the protocol 
that allows the emulation of collision detection given in [2] . By theorem 6 and since each broadcast costs 
0{^/n Inn) time slots, the initialization of a multihop wireless network can be done in subquadratic time 
slots, viz. 0(\/n^ Inn). With extremely small probability, there will be duplicate IDs. These errors are 
rare and can be checked once the nodes are "identified" with the help of the deterministic distributed 
0{r\^l'^ In^ n) gossiping protocol given in [6]. 

Step 3) In the presence of failures, reiterate the process by augmenting the values given in Step 1) and 
go directly to Step 2). All together, combinations of these algorithms lead to an initialization protocol 
which always terminates in expected time 0{n'^^^ In^ n). 

V. Conclusion 

We showed that given a randomly distributed wireless nodes with density n/\X\, when the transmission 
range of the nodes is set to r = \J 0-+ ^) ^"^i^^ '■ (i) the hop-diameter is less than 5y^ ^-^_^^"j ^, (ii) 
the network is 0(lnn)-connected, each point of the support area is monitored by 8(lnn) nodes and the 
degrees of all nodes are 0(lnn), with high probability. We showed how these results can help to conduct 
precise analysis in order to design protocols for the self-configuration of the network. The protocols of 
this paper are fully distributed and assume only as a priori knowledge of the nodes the size of the support 
area X. These results illustrate how fundamental limits of random networks can help researchers and 
developpers for the design of algorithms in the extremal scenarios and the protocols given in this paper 
can serve as basis for other decentralized algorithms. 

As a final comment, it is important to note that for the single-hop cases, energy- efficient protocols have 
been designed by Nakano and Olariu [23] . Their works naturally suggest a generalization for the multihop 
cases under various scenarios. 
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Appendix 



The Lambert W function. In this paragraph, we give some properties of the function satisfying 
We remark here that the function W, in particular the principal branch Wa, already 
plays a central key role when studying the random graph model G(n, m), i.e., the random graph built with 
n vertices and m edges which is the "enumerative counterpart" of the G(n,p) random graph model (see, 
e.g., the "giant paper" [14]). In fact, —Wo{—x) is the exponential generating function that enumerates 
Cayley's rooted trees [4] and we have 



We plot in Figure 8 the two real branches of the Lambert W function considered in this paper. This 
function has been recognized as solutions of many problems in various fields of mathematics, physics and 
engineering as emphasized in [7]. The Lambert W is considered as a special function of mathematics 
on its own and its computation has been implemented in mathematical software as Maple. Figure 8 
represents the two real branches of the Lambert W function. It is shown that the two branches meet 
at point M = (— 1/e, —1). As an example, if £ = | in (7), each point of the area X is covered, with 
high probability, by at least ~ .1520088850 Inn disks. We have the Figure 7 depicting the function 
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Fig. 8 

The branches Wo (dashed line) and W-\ 
(solid line) of the Lambert W function. 
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